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Abstract —This paper studies the decentralized quadratic 
cheap talk and signaling game problems when an encoder and 
a decoder, viewed as two decision makers, have misaligned 
objective functions. The main contributions of this study are 
the extension of Crawford and Sobel’s cheap talk formulation 
to multi-dimensional sources and to noisy channel setups. We 
consider both (simultaneous) Nash equilibria and (sequential) 
Stackelberg equilibria. We show that for arbitrary scalar 
sources, in the presence of misalignment, the quantized nature 
of all equilibrium policies holds for Nash equilibria in the sense 
that all Nash equilibria are equivalent to those achieved by 
quantized encoder policies. On the other hand, all Stackelberg 
equilibria policies are fully informative. For multi-dimensional 
setups, unlike the scalar case, Nash equilibrium policies may 
be of non-quantized nature, and even linear. In the noisy 
setup, a Gaussian source is to be transmitted over an additive 
Gaussian channel. The goals of the encoder and the decoder 
are misaligned by a bias term and encoder’s cost also includes 
a penalty term on signal power. Conditions for the existence of 
affine Nash equilibria as well as general informative equilibria 
are presented. For the noisy setup, the only Stackelberg 
equilibrium is the linear equilibrium when the variables are 
scalar. Our findings provide further conditions on when affine 
policies may be optimal in decentralized multi-criteria control 
problems and lead to conditions for the presence of active 
information transmission in strategic environments. 

Index Terms —Information theory, game theory, signaling 
games, cheap talk, quantization. 

I. Introduction 

Team theory is concerned with the interaction dynamics 
among decentralized decision makers with identical ob¬ 
jective functions. On the other hand, game theory deals 
with setups with misaligned objective functions, where each 
player chooses a strategy to maximize its own utility which 
is determined by the joint strategies chosen by all players. In¬ 
formation transmission in team problems is well-understood 
with extensive publications present in the literature; for a 
detailed account we refer the reader to 0. Despite the 
difficulty to obtain solutions under general information struc¬ 
tures, it is evident in team problems that more information 
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provided to any of the decision makers does not hurt the 
system performance and there is a well-defined partial order 
of information structures as studied by Blackwell 12 and 
others. However, for general non-zero sum game problems, 
informational aspects are very challenging to address; more 
information can hurt some or even all of the players in 
a system, see e.g. 0. Further intricacies on informational 
aspects in competitive setups have been discussed in |4|, f5l 
and ®. 

Signaling games and cheap talk are concerned with a 
class of Bayesian games where an informed decision maker 
transmits information to another decision maker. Unlike a 
team setup, however, the goals of the agents are misaligned. 
Such a study has been initiated by Crawford and Sobel 0, 
who obtained the striking result that under some technical 
conditions on the utility functions of the decision makers, 
the cheap talk problem only admits equilibrium policies that 
are essentially quantization policies. This is in significant 
contrast with the case where the utility functions are aligned. 

The cheap talk and signaling game problems find applica¬ 
tions in networked control systems when a communication 
channel/network is present among competitive and non- 
cooperative decision makers GO- For example, in a smart 
grid application, there may be strategic sensors in the 
system 0 that wish to alter the equilibrium decisions at a 
controller receiving data from the sensors to lead to a more 
desirable equilibrium, for example by enforcing an outcome 
to enhance its prolonged use in the system. One may also 
consider a utility company which wishes to inform users 
regarding pricing information; if the utility company and 
the users engage in selfish behaviour, it may be beneficial 
for the utility company to hide certain information and the 
users to be strategic about how they interpret the given 
information. One further area of application is recommender 
systems (as in rating agencies) ESI- All of these applications 
lead to a drastically new framework where the value of 
information and its utilization are very fragile to the system 
under consideration and our study here is an initiator for 
such a general setup. 

Even though in this paper we only consider quadratic 
criteria under a bias term leading to a misalignment, the 
contrast with the case where there is no bias (that has been 
heavily studied in the information theory literature) raises a 
number of sharp conclusions for system designers working 
on networked systems under competitive environments. 

Identifying when optimal policies are linear or affine for 
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decentralized systems involving Gaussian variables under 
quadratic criteria is a recurring problem in control theory, 
starting perhaps from the seminal work of Witsenhausen 
ED, where suboptimality of linear policies for such prob¬ 
lems under non-classical information structures is presented. 
The reader is referred to Chapters 3 and 11 of JT] for 
a detailed discussion on when affine policies are and are 
not optimal. These include the problem of communicating 
a Gaussian source over a Gaussian channel, variations of 
Witsenhausen’s counterexample 1H; and game theoretic 
variations of such problems. For example if the noise vari¬ 
able is viewed as the maximizer and the encoders/decoders 
(or the controllers) act as the minimizer, then affine policies 
may be optimal for a class of settings, see m-m. 
E2 also provides a review on Linear Quadratic Gaussian 
(LQG) problems under nonclassical information including 
Witsenhausen’s counterexample. Our study provides further 
conditions on when affine policies may constitute equilib¬ 
ria for such decentralized quadratic Gaussian optimization 
problems. 

There have been a number of related contributions in 
the economics literature in addition to the seminal work 
by Crawford and Sobel, which we briefly review in the 
following: Reference (l8| shows that even if the sender 
and the receiver have identical preferences, perfect com¬ 
munication may not be possible in an equilibrium because 
information transmission may be costly. Reference ED 
studies the setup in 0 with two senders and shows that 
if senders transmit the messages sequentially once, then 
the equilibrium is always quantized and if senders transmit 
the messages simultaneously and their biases are either 
both positive or both negative, then a fully revealed equi¬ 
librium is possible. Reference E2 studies a scalar setup 
and proves that if multiple senders transmit the messages 
sequentially and their biases have opposite signs, then a fully 
revealed equilibrium is possible; this study also considers 
two-dimensional real valued sources, and shows that a fully 
revealed equilibrium occurs if and only if the multiple 
senders have perfectly opposing biases. Moreover, multi¬ 
dimensional cheap talk with multiple senders is analyzed in 
ED and l22l with unbounded and bounded state spaces, 
respectively. The study in |[23l considers a special noisy 
channel setup between the sender and receiver, and shows 
that there may be infinitely many actions (countable or 
uncountable) induced in an equilibrium even though all 
equilibria are interval partitions in the noiseless case Q. 
Conditions for Nash equilibria are investigated in ll24l for 
a scenario in which there exists a discrete noisy channel 
between an informed sender and an uninformed receiver, 
and the source is finitely valued. Furthermore, there are some 
contributions which modify the information structure given 
in Crawford and Sobel’s setup: In l25l , the sender knows 
that the receiver has partial information about his/her private 
information; whereas the sender does not know this in l26l . 
ED. Reference |28l studies Crawford and Sobel’s setup in a 
finite horizon environment where, in each period, a privately 


informed sender transmits a message and a receiver takes an 
action. For a detailed literature review on communication 
between informed experts and uninformed decision makers, 
we refer the reader to |29l . We note also that in the area of 
information theory, there exists a vast literature on security 
aspects of information transmission, see e.g., GDI, ED- 
Game theoretic analysis is also useful in various contexts 
involving security problems. For example, the security of 
the smart-grid infrastructure can be analyzed by considering 
the adversarial nature of the interaction between an attacker 
and a defender l32l . f33l , and a game theoretic setup would 
be appropriate to analyze such interactions. For an overview 
of security and privacy problems in computer networks that 
are analyzed within a game-theoretic framework, ES can 
be referred. 

In the control community, recently, there have been few 
studies: lf35l considered a Gaussian cheap talk game with 
quadratic cost functions where the analysis considers Stack- 
elberg equilibria, for a class of single- and multi-terminal 
setups and where linear equilibria have been studied. For 
the setup of Crawford and Sobel but when the source 
admits an exponentially distributed real random variable, 
f36l establishes the discrete-nature of equilibria, and obtains 
the equilibrium bins with finite upper bounds on the number 
of bins under any equilibrium in addition to some structural 
results on informative equilibria for general sources. 

A. Contributions 

The main contributions of this study are as follows. 
We prove that for any scalar source, all Nash equilibrium 
policies at the encoder are equivalent to some quantized 
policy, but all Stackelberg equilibrium policies are fully 
informative. That is, there is some information hiding for the 
Nash setup, as opposed to the Stackelberg setup. We show 
that for multi-dimensional setups, however, unlike the scalar 
case, Nash equilibrium policies may be non-quantized and 
can in fact be linear. In the noisy setup, a Gaussian source 
is to be transmitted over an additive Gaussian channel. The 
goals of the encoder and the decoder are misaligned by a 
bias term and encoder’s cost also includes a penalty term 
of the transmitted signal. Conditions for the existence of 
affine equilibrium policies as well as general informative 
Nash equilibria are presented for both the scalar and multi¬ 
dimensional setups. We compare the results with socially 
optimal costs and information theoretic lower bounds, and 
discuss the effects of the bias term on equilibria. Further¬ 
more, we prove that the only equilibrium in the Stackelberg 
noisy setup is the linear equilibrium for the scalar case. 

II. Problem Definition 

Let there be two decision makers (DMs): An encoder (DM 
1) and a decoder (DM 2) as shown in Fig. |T] DM 1 wishes to 
encode the M-valued random variable M to DM 2. Let X 
denote the X-valued random variable which is transmitted 
to DM 2. DM 2, upon receiving X, generates its optimal 
decision U which we also take to be M-valued. We allow 
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for randomized decisions, therefore, we let the policy space 
of DM 1 be the set of all stochastic kernels from M to xQ 
Let L e denote the set of all such policies. We let the policy 
space of DM 2 be the set of all stochastic kernels from X 
to M. Let T d denote the set of all such stochastic kernels. 
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X 
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Fig. 1 : General system model for noiseless case. 

Given 7® £ T® and y d £ l 7 the goal in the classical 
communications theory is to minimize the expectation 

^( 7 -,y)=/c(m,„h-(dx|m)y(d„|x)P(dm), 

where c is some cost function. One very common case is 
the setup with c(m, u) = \m — u\ 2 . 

Recall that a collection of decision makers who have an 
agreement on the probabilistic description of a system and a 
cost function to be minimized, but who may have different 
on-line information is said to be a team (see, e.g. Cl). Hence, 
the classical communications setup may be viewed as a team 
of an encoder and a decoder. 

In many applications (in networked systems, recommen¬ 
dation systems, and applications in economics) the objec¬ 
tives of the encoder and the decoder may not be aligned. 
For example, DM 1 may aim to minimize 

P( 7 ', 7 a ) - /c- ( ™,„) 7 *(d I |m)/(d»|x)P(dm), 

whereas DM 2 may aim to minimize 


pair of policies 7* ; ®, y*' d is said to be a Nash equilibrium 
if 

< J e (i e n* ,d ) V7 e £ r e , 

J d ( 7 *’ e , 7 *’ d ) < J d { 7 *’ e , 7 d ) V 7 d £ T d . 

We note that when c e = c d the setup is a traditional 
communication theoretic setup. If c e = —c d , that is, if the 
setup is a zero-sum game, then an equilibrium is achieved 
when 7 e is non-informative (e.g., a kernel with actions 
statistically independent of the source) and 7“ uses only 
the prior information (since the received information is non- 
informative). We call such an equilibrium a non-informative 
(babbling) equilibrium. The following is a useful observa¬ 
tion, which follows from 0: 

Proposition 2.1: A non-informative (babbling) equilib¬ 
rium always exists for the cheap talk game. 

In the discussion so far, a simultaneous game-play is 
assumed and thus equilibrium refers to a Nash equilibrium. 
Besides the simultaneous game-play, one can also consider 
a sequential game-play; i.e. first the encoder sends the 
message, then the decoder receives it and takes an action 
sequentially while first the encoder’s policy is announced. 
Stackelberg equilibria arise in this case. In the Stackelberg 
game, the encoder announces his coding strategy and since 
the decoder takes an action after receiving the message, 
the encoder knows the optimal action which will be taken 
by the decoder and chooses the message to be transmitted 
accordingly. A pair of policies 7*> e ,7*’ d is said to be a 
Stackelberg equilibrium if 


J e ( 7*7 7*’“(7*’ e )) < J®( 7 e , 7 *’ 77 e )) V 7 e £ T 


where y*’ d fy e ) satisfies 


J d { 7 e ,7 M (7 e ))<-7(7 e ,7 d (7 e )) 


v 7 d £ r d . 


J d (7®,7 d ) = J c d (m,u) r t e (dx\m)'y d (du\x)P(dm). 

In this study, the problems are investigated where the en¬ 
coder and the decoder are deterministic rather than ran¬ 
domized; i.e., 7®(da;|m) = l{/e(m)edx} and 7 d (dit|a;) = 
l{/ d (x)edu} where 1{d} denotes the indicator function of 
an event D, and / e (m) and f d {x) are some deterministic 
functions of the encoder and decoder, respectively. Such 
a problem is known in the economics literature as cheap 
talk (the transmitted signal does not affect the cost, that is 
why the game is named as cheap talk). A more general 
formulation would be the case when the transmitted signal 
is also an explicit part of the cost function c e or c d \ in that 
case, the setup is called a signaling game. We will consider 
a noisy communication setup, where the problem may be 
viewed as a signaling game, rather than cheap talk, later in 
this study. 

Since the goals are not aligned, such a problem is studied 
under the tools and concepts provided by game theory. A 

'Recall that P is a stochastic kernel from M to X if P(jm) is a 
probability measure on B(X) for every m £ M and for every Borel 
A £ Z3(X), P(A | ■) is a Borel measurable function of m. 


Throughout the paper, all equilibrium terms refer to the Nash 
equilibrium unless otherwise stated; it will be separately 
indicated for the Stackelberg game setup and equilibrium. 

Crawford and Sobel 0 have made foundational contribu¬ 
tions to the study of cheap talk with misaligned objectives 
where the cost functions c e and c d satisfy certain monotonic¬ 
ity and differentiability properties but there is a bias term in 
the cost functions. Their result is that the number of bins 
in an equilibrium is upper bounded by a function which is 
negatively correlated to the bias. 

We will first consider the scalar setting by taking the 
cost functions as c e (m, u) = (m — u — b ) 2 and c d (to, u) = 
(to — u ) 2 where b denotes the bias term. The motivation for 
such functions stems from the fields of information theory, 
communication theory and LQG control; for these fields 
quadratic criteria are extremely important. Recall that for the 
case with 6 = 0, the cost functions simply reduce to those 
for a minimum mean-square estimation (MMSE) problem. 

III. Quadratic Cheap Talk 
A. Nash Equilibria in the Scalar Case 

As before, let the cost functions be defined as 

c d (to, u) = (to — u) 2 and c e (to, u) = (to — u — b ) 2 where 
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6 is the bias term. Some existence and deterministic prop¬ 
erties of the equilibrium policies of the encoder and the 
decoder are stated in lf36l and IH Chp.4]. 

Theorem 3.1: lf36l (i) For any 7 e , there exists an optimal 
7 d , which is deterministic, (ii) For any r ) d , any randomized 
encoding policy can be replaced with a deterministic 7 e 
without any loss to DM 1. (iii) Suppose 7 e is an M-cell 
quantizer, then there exists an optimal r y d , which is the 
conditional expectation of the respective bin. 

The following builds on 0 Lem.l], which considers 
sources on [0,1] that admit densities. We note that the 
analysis here applies to arbitrary scalar valued random 
variables. The proofs essentially follow from 0|. 

Theorem 3.2: Let m be a real-valued random variable 
with an arbitrary probability measure. Let the strategy set 
of the encoder (DM 1) consists of the set of all measurable 
(deterministic) functions from M to X. Then, an equilibrium 
encoder policy has to be quantized almost surely, that is, it is 
equivalent to a quantized policy for the encoder in the sense 
that the performance of any equilibrium encoder policy is 
equivalent to the performance of a quantized encoder policy. 
Furthermore, the quantization bins are convex. 

Proof: Let there be an equilibrium in the game (with 
possibly uncountably infinitely many bins, countably many 
bins or finitely many bins). Let two bins be B a and B 1 . 
Also let m a indicate any point in B a ; i.e., m a £ B a . 
Similarly, let m 8 represent any point in B 8 ; i.e., m 8 £ B 8 . 
The decoder chooses action u a = E[to|to £ B a ] when the 
encoder sends m“ £ B a and action u 8 = E[m|m £ B 8 ] 
when the encoder sends rri 8 £ B 8 in order to minimize 
its total cost. Without loss of generality, we can assume 
that u a < u 8 . Let F(m,u ) = (m — u — b) 2 . Because of 
the equilibrium definitions from the view of the encoder; 
F(m a ,u a ) < F(m a ,u 8 ) and F(m 8 ,u 8 ) < F(m 8 ,u a ). 
Hence, 3 to that satisfies F(rn,u a ) = F(m,u 8 ) which 
reduces to 

m =- ^ U + b -+=>■ (m — u a ) = (u 8 — rn) + 26 (1) 

Since F(m+ A, u a ) > F(m+ A, u 8 ) for any A > 0, B 8 and 
{m\m < to} are disjoint sets. Similarly, B a and {to|to > 
to} are disjoint sets, too. Thus, from the definitions of u a 
and u 8 , we have u a < to < u 8 which implies rn — u a >0 
and u 8 -m> 0. Then, from £[}, 

u 8 — u a = ( u 13 — rn) + (to — u a ) = 2 (u 8 — rn) + 26 
> 26 

and 

u 8 — u a = (vP — rn) + (m — u a ) = 2 (to — u a ) — 26 
> -26 

are obtained. Hence, u 8 u n > 2|6|, which implies that 
there must be at least 2|6| distance between the equilibrium 
points (decoder’s actions, centroids of the bins). Further, 
from the encoder’s point of view, given any two bins B a 
and B 8 , there exists a point rn which lies between these 


two bins. This assures that each bin must be a single 
interval; i.e., convex cell except for a possible insignificant 
set of points with measure zero. Since there is an injective 
and monotonic relation between the convex cells of the 
encoder and decoder’s actions, the equilibrium policy must 
be quantized almost surely. ■ 

Recall again that for the case when the source admits 
density on [0,1], Crawford and Sobel established the discrete 
nature of the equilibrium policies. For the case when the 
source is exponential, l36l established the discrete-nature, 
and obtained the equilibrium bins with finite upper bounds 
on the number of bins in any equilibrium. 

B. Stackelberg Equilibria in the Scalar Case 

We will now observe that the Stackelberg setup is less 
interesting. 

Theorem 3.3: The Stackelberg equilibrium is unique and 
corresponds to a fully revealing (fully informative) encoder 
policy. 

Proof: Due to the Stackelberg assumption, the encoder 
knows that the decoder will use ^ d (x) = u = E[to|o;] as an 
optimal decoder policy to minimize its cost. Then the goal 
of the encoder is to minimize the following: 

min E[(m — u — 6) 2 ] = min E[(m — E[?n|x] — 6) 2 ] 

x=7 e (m) x=f e (m) 

= min E[(to — E[to|x]) 2 ] + 6 2 

x=7 e (m) 

= min E[(to - m) 2 ] + 6 2 . 

x=7 e (m) 

Here, the second equality follows from the law of the iterated 
expectations. Since the goal of the decoder is to minimize 
min„ =7 <i( x ) E[(m — u ) 2 ], the goals of the encoder and the 
decoder become essentially the same in the Stackelberg 
game setup, which effectively reduces the game setup to 
a team setup. In the team setup, the equilibrium is fully 
informative; i.e. the encoder reveals all of its information. ■ 

C. Multi-Dimensional Cheap Talk: Nash Equilibria 

Our goal in this subsection is to show that it is possible to 
have linear equilibria in a multi-dimensional quadratic cheap 
talk, unlike the scalar setup. Let the source be uniform on 
[0,1] x [0,1] and the cost function of the encoder be defined 
by c e (rh,u) = ||to — u — 6|| 2 and the cost function of the 
decoder be defined by c d {fh,u) = ||to — m || 2 where the 
lengths of the vectors are defined in +2 norm and 6 is the 
bias vector. For such a scenario, we have the following result. 

Theorem 3.4: An equilibrium policy can be non-discrete 
and even linear. 

Proof: It suffices to provide an example. Consider 6 = 
[0.3 0], Then, as a (properly interpreted) limit case of the 
equilibrium in Fig. [2] the following encoder and decoder 
policies form an equilibrium: 

7 6 (toi, to 2 ) = {xi,x 2 ) = (0, to 2 ) , 

^ d (xi,x 2 ) = (tti, uf) = (0.5, to 2 ) . 
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Here, the scalar setup is applied on the ^-dimension with 
one quantization bin (recall that u\ = E[mi|a;i]), and 
a fully-informative equilibrium exists on the '(/-dimension 
since there is no bias on that dimension. It is observed that 
the encoder policy is linear due to the unbiased property of 
the //-dimension. ■ 


Fig. 2: There is 1 quantization level on the x-dimension 
and 200 quantization levels on the //-dimension. The number 
of quantization levels on the //-dimension can be arbitrarily 
chosen (since b is orthogonal to that dimension). As the 
number of levels goes to infinity, this construction converges 
to the structure of a linear equilibrium. 

Besides linear equilibria, there may be multiple (hence, 
non-unique) quantized equilibria with finite regions in the 
multi-dimensional case as illustrated in Fig. [3] 


in other coordinates. Furthermore, if the encoder only sends 
the 0 variable for the value of the only bin in the coordinate 
for which the bias has nonzero component, then what we 
have is indeed a linear policy. 

D. Multi-Dimensional Cheap Talk: Stackelberg Equilibria 

The Stackelberg equilibria in the multi-dimensional cheap 
talk can be obtained by extending its scalar case; i.e., 
it is unique and corresponds to a fully revealing (fully 
informative) encoder policy as in the scalar case. Thus, 
Theorem 13.31 holds for the multi-dimensional case as well. 

IV. Quadratic Signaling Game: Scalar Case 

The noisy game setup is similar to the noiseless case 
except that there exists an additive Gaussian noise channel 
between the encoder and decoder, as shown in Fig. [4] and 
the encoder has a soft power constraint. 



Fig. 4: General system model for noisy case. 


The encoder (DM 1) encodes a zero-mean Gaussian ran¬ 
dom variable M and sends the real-valued random variable 
X. During the transmission, the zero mean Gaussian noise 
with a variance of <r 2 is added to X ; hence, the decoder (DM 
2) receives Y = X + W. The policy space of DM 1, T e , is 
similarly defined as the policy space in the noiseless case: 
the set of stochastic kernels from R to R (this can be viewed 
as the measurable subset of the space of all product measures 
on R 2 with a fixed input marginal, under the weak topology). 
The policy space of DM 2, r d , is the set of stochastic kernels 
from R to R. The cost functions of the encoder and the 
decoder are also slightly modified as follows: DM 1 aims to 
minimize 
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Fig. 3: Sample equilibria in 2D with b x = 0.1 and b y = 
0.2 where the crosses indicate the centroids of the bins, the 
star indicates the middle point and the square indicates the 
shifted middle point. 

From the discussion above, it can be deduced that if 
b is orthogonal to the basis vectors or satisfies certain 
symmetry conditions, then non-discrete or linear equilibria 
exist. This approach applies also to the n-dimensional setup 
for any n € N. For example, if the bias vector involves 
only one nonzero coordinate component and if the source 
distribution is uniform over an n-dimensional unit cube, then 
full information revelation in all the other coordinates will 
lead to a non-discrete equilibrium. In particular, if nonzero 
component of the bias is greater than 0.25, then there is only 
one bin in that coordinate and the full information is sent 


J e { 7 e ,7 d ) 

c e (m, x, u) r y e (dx\m)"/ d (du\y)P(dy\x)P(dm), 

whereas DM 2 aims to minimize 

J d ( 7 e ,7 d ) 

c d (m, u)'y e (dx\m) r y d (du\y)P(dy\x)P(dm ), 

where P(dy\x) = P(W € dy — x) with W ~ 
J\f( 0, a 2 ). The cost functions are modified as c e (to, x, u ) = 
(to — u — b) 2 + Xx 2 and c d (to, u) = (to — u) 2 . Note that a 
power constraint with an associated multiplier is appended to 
the cost function of the encoder, which corresponds to power 
limitation for transmitters in practice. If A = 0, this corre¬ 
sponds to the setup with no power constraint at the encoder. 
Here, as earlier, the signaling game problem is investigated 
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where the encoder and the decoder are deterministic; i.e., 
7 e (dx|m) = l{ f e( m)edx} and ^ d {du\y) = Iq/^edu} 
where / e (m) and f d (y) are some deterministic functions 
of the encoder and decoder, respectively. 


A. A Supporting Result 

Suppose that there is an equilibrium with an arbitrary 
policy leading to finite (at least two), countably infinite or 
uncountably infinite equilibrium bins. Let two of these bins 
be £>“ and B lj . Also let m a indicate any point in B a ; i.e., 
m a G B a ; and the encoder encodes m“ to x“ and sends 
to the decoder. Similarly, let rrv' represent any point in B :1 ; 
i.e., mP G B^\ and the encoder encodes mP to x& and 
sends to the decoder. Without any loss of generality, we 
can assume that m a < rrv 1 . The decoder chooses the action 
u = E [m\y\ (MMSE rule). Let F(m, x) be the encoder cost 
when message to is encoded as x; i.e., 

F(m, x) = / p (j d ( y ) = u\j e (to) = x ) 

■I y 

x ^(to — u — b) 2 + Xx 2 ^jdy . 

Then the equilibrium definitions from the view of the en¬ 
coder requires F(m a ,x a ) < F(m a ,x /3 ) and F(m^,x^) < 
F(m^,x a ). Now let G(m) = F(m,x a ) — F(m,x /3 ). If 
it can be shown that G(m) is a continuous function of 
to on the interval [to“,to^], then it can be deduced that 
3 to € [to“,to^] such that G(to) = 0 by the Mean Value 
Theorem since G(m“) < 0 and G(mP) > 0. 

Proposition 4.1: G(m) is a continuous function of m on 
the interval [m a ,m^]. 

Proof: It suffices to show that F(m, x) is continuous in 
to. Let {m n } be a sequence which converges to to. Recall 
that (m n — u — b) 2 < 2to 2 + 2(it + b) 2 < oo since m 
is bounded from above and below (to G [m a ,m^]), b is 
a finite bias and E[u 2 ] = E[( 7 d (y)) 2 ] < oo (note that any 
finite cost E[(m — it 2 )] inevitably leads to a finite E[it 2 ] since 
E[m 2 ] = E[(m + it — to) 2 ] < 2E[to 2 ] + 2E[(to — u 2 )] < oo). 
Then, by the dominated convergence theorem, 


lim F(m n ,x) = lim p (j d (y) = (m n ) = x) 

n—f oo n—f oo J^ 1 

x (^{rrin — u — b) 2 + Ax 2 ^dy 

= P(l d {y) = tt|7 e (m) = x) 

Jy 

x ^(to — it — b) 2 + Xx 2 ^j dy = F(m, x) 


which shows the continuity of F(-,x) in the interval 
(m a ,mP). ■ 

From Proposition 14. II 3 to G [to“, to' 3 ] such that G(to) = 
0 which implies F(rn,x a ) = F(rn,x /3 ). Then 


IyP (l d (y) = u\r (m)=x a ) 

x ((to — u — b) 2 + A(a:“) 2 )dy 

= IyP b d (y) = u h e ( m ) = x /3 ) 

x ((to — u — b) 2 + A(a;^) 2 )dy. 


As a result, 

e [( t d (y)) 2 \x j3 } - mi d (y)) 2 \x a ] 

2 (E[ 7 d (y)|x^] - E[ 7 d (y)|a: a ]) 

A ((x ^) 2 — (x Q ) 2 ) 

j _id_ L _i_ LJ. _l h (7) 

2 (E[ 7 d (y)|x^] - E[ 7 d (y)|x a ]) 

is obtained. Recall that the arguments in Theorem l3.2l cannot 
be applied here because of the presence of noise. However, 
when there is noise in a communication channel, the relation 
between E[m|x], E[it 2 |x] and to can be constructed as in (0. 


B. Existence and Uniqueness of Informative Equilibria and 
Affine Equilibria 


We first note that Proposition [2J] is valid also in the noisy 
formulation; i.e. a non-informative (babbling) equilibrium 
is an equilibrium for the noisy signaling game, since the 
appended power constraint is always positive. The following 
holds: 


Theorem 4.1: 

1) Let 0 < A < 


3) 


E[m 2 
E [w 2 


For any b G I, there exists a 


unique informative affine equilibrium. 


2) If A > 


E \m J 


— E[ui 2 ] ' 


there does not exist an informative 


(affine or non-linear or even randomized) equilibrium. 
The only equilibrium is the non-informative one. 

If A = 0, there exists no informative equilibrium with 
affine policies. 


Before presenting the proof, we make the following remark. 


Remark 4.1: The expression defines a quantity 

which determines the Shannon-theoretic capacity of the 
channel given a signal energy constraint at the encoder. This 
can be interpreted as Signal-to-Noise Ratio (SNR) of the 
received signal, which is related to the channel attenuation 
coefficient. If the multiplier of the signal A in the cost 
function is greater than it will not be rational for 

the encoder to send any signal at all under any equilibrium. 

Proof: 

1) If the encoder is linear (affine), the decoder, as an 
MMSE decoder for a Gaussian source over a Gaussian 
channel, is linear (affine); this follows from the prop¬ 
erty of the conditional expectation for jointly Gaussian 
random variables. Suppose on the other hand that the 
decoder is affine so that u = "f d {y) = Ky + L and 
the encoder policy is x = 7 e (m). We will show 
that the encoder is also affine in this case: With 
y = 7 e (m)+w, it follows that u = K'y e (m)+Kw+L. 
By completing the square, the optimal cost of the 
encoder can be written as 


J*’ e = min E[(to — u — b) 2 + Ax 2 ] 

x—"Y e (m) 


= mm 

7 e (m) 


(m — L — b)K \ 2 


+ 


X (E[rn 2 ] + (L + 6 ) 2 ) + AT 2 E[ii; 2 ]. 


)' 


K 2 + A' 
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Hence, the optimal 7 e (m) can be chosen as 

(m — L — b)K (m — L — b ) 

7 ’ ^ “ AT 2 + A “ K + X/K 


(3) 


and the minimum encoder cost is obtained as 

J *’ e = K^xi E[m2] + {L + b)2 ) + r2e[w2] ■ 

(4) 


Recall that (0 implies that an optimal encoder policy 
for a Gaussian source over a Gaussian channel is an 
affine policy if the decoder policy is chosen as affine. 
We now wish to see if these sets of policies satisfy 
a fixed point equation. If the decoder has an affine 
policy, it is proved that the optimal policy of the 
encoder is also affine : 

7 -(m) = Am + C = (jjHyjf ) m + (jj 2 ^) 

(5) 


On the other hand, with the given affine encoding 
policy x = 7 ® (to) = Am + C, the optimal decoder 
policy would be 


7 d {y) = Ky + L = 


HE [to 2 


A 2 E[m 2 }+E[w 2 j y C) ' (6) 

By combining these, we obtain ( K 2 + A) 2 E[ur] = 
AE[to 2 ] by assuming A ^ 0; which implies K 2 = 
yj ^ — A. If we combine the equations above by 
using A, and define the resulting mapping as T(H), 
we obtain 


A = 


A 2 +E[w 2 ] /E[m 2 ] 


A 2 +E[w 2 ] /E[m 2 


r + A 


= T(A ). (7) 


Note now that 

A > 1 


A 


A 2 


E[u ; 2 

E\m? 


< 1 


T(A) < -, 


A < 1 


A 2 


V[w 2 

Elm? 


< 


E[?77 2 

E[u > 2 


T(A) < 


E[m 2 

— 


which implies that the mapping defined by T(H) = A 
can be viewed as a continuous function mapping the 
compact convex set [0, max(E[TO 2 ]/E[u; 2 ], 1)/A] to 
itself. Therefore, by Brouwer’s fixed point theorem 
ED, there exists A = T(H). Indeed, we can find 
nonzero A for every 0 < A < J 0 After finding 
A, the values for K, C and L can also be obtained 
based on the equilibrium equations in 0 and 0 . For 
the uniqueness of an informative fixed point, suppose 
that there are two different nonzero fixed points: A\ = 


2 Recall that if A A 0 and 0 < A < 2 7%: , we have K 2 = , / 1 _ 

' K[w z \ y ¥j[w z \ 

A, which implies A = = ±^Jyj j - f[^. 


T(Ai) and A 2 = T(H_ 2 ) and let 7 = E[w 2 ]/E[to 2 ] for 
simplicity. Then Ai/T(A{) = H_ 2 /T(H 2 ) implies 


A 2 


A 2 


+ A(H 2 +7) = 


All 




Al 


(H 2 + 7)(H 2 +7) 


+ ^(^2 + 7 ) 


+ a) = 0 . 


Hence, |Hi| = |H. 2 | is obtained, and since the map¬ 
ping is defined from [0, max(E[? 77 2 ]/E[u; 2 ], 1)/A] to 
itself, the nonzero fixed point is unique. Then the 
encoder may choose the nonzero fixed point for the 
informative equilibirum if it results in a lower cost 
than the non-informative equilibrium (due to the cost 
of communication, an informative equilibrium is not 
always beneficial to the encoder compared to the non- 
informative one). 

2) Let A > E[to 2 ]/E[to 2 ] and suppose that we are in an 
equilibrium. Then, the encoder cost J e = E[(m — u - 
b) 2 + \x 2 ] reduces to J e = E[(to — u ) 2 ] + AE[a; 2 ] + b 2 
since the decoder in an equilibrium always chooses 
u = E[to|j/]. Through P = E[a; 2 ], the following 
analysis leads to a lower bound on the encoder cost: 


J e = b 2 + AE[ar] + E[(m — u) 2 ) 

> b 2 + XP + E[m 2 ]e -2 sup I ( X ' Y '> 


= b 2 + A P + E[TO 2 ]e 


— 2 2 log 




= b 2 + XP + 


E[r 


( 8 ) 


1 + P/E[w 2 } ' 

Here, (a) follows from a rate-distortion theoretic bound 
through the data-processing inequality (see for ex¬ 
ample p. 96 of HD). However, it follows that when 
A > E[m 2 ]/E[ui 2 ], © is minimized at P = 0; that 
is, the encoder does not signal any output. Hence, the 
encoder engages in a non-informative equilibrium and 
the minimum cost becomes E[to 2 ] + b 2 at this non- 
informative equilibrium. 

3) It is proved that an optimal encoder is affine such that 
x = 7 ® (to) = Am+C when the decoder is affine, that 
is, u = 7 d (y) = Ky + L. Then, by inserting A = 0 to 
0 , to is obtained asm = KA ^ m ^ +KC+L+b. 

This holds for all to“ and with to“ < to < mP. 
Thus, if the distance between m a and to 13 is made 
arbitrarily small, then it must be that KA = 1 and 
KC + L + b = 0. On the other hand, it was shown 
that an optimal decoder policy is affine if an encoder 
is affine in ©. By combining KA = 1 and K = 
A%[^|+E[tH] ’ il f°ll° ws that a real-valued solution 
does not exist for any given affine coding parameter. 


Remark 4.2: Note that, from 0 and 0 , we have A = 

K+x/K' K = A a E[m^+E[«j a ]’ ^ = and Ab = (HA - 

\)C. From these equalities, we observe the following: 

1) when A = 0, it is shown in Theorem 14. II that there is 
not any fixed point solution to 0 . However, if there 





































is not a noisy channel between the encoder and the 
decoder; i.e., the noise variance is zero (E[w 2 ] = 0), 
then {7} has a fixed point solution. Even when £7} 
has a fixed point solution A, ([7} and 0 cannot hold 
together unless 6 = 0. 

2) when the noise variance is zero (E[u; 2 ] = 0), there 
is not any fixed point solution to 0 unless A = 0. 
Even when 0 has a fixed point solution A, (0 and 
0 cannot hold together unless 6 = 0. 

3) when A = 0 and the noise variance is zero (E[w 2 ] = 
0); the consistency of 0 and 0 can be satisfied if 
only if 6 = 0. Hence, if 6 ^ 0, there cannot be a affine 
equilibrium; the equilibrium has to be discrete due to 
Theorem 13.21 

Thus, if either A or E[w 2 ] is 0, an affine equilibrium exists 
only if A, E[w 2 ] and 6 are all 0. 


C. Price of Anarchy and Comparison with Socially Optimal 
Cost 

In a game theoretic setup, the encoder and the decoder try 
to minimize their individual costs, thus the game theoretic 
cost can be found as min 7 « J ,: -fniin,,./ J d . If the encoder and 
the decoder work together to minimize the total cost, then the 
problem can be regarded as a team problem and the resulting 
cost is a socially optimal cost, which is min 7 e j7 d(J e + J d ). 
In the game theoretic setup, because of the selfish behavior 
of the players, there is some loss from the socially optimal 
cost, and this loss is measured by the ratio between the 
game theoretic cost and the socially optimal cost, which was 
proposed as a price of anarchy (38). In this part, it will be 
shown that the game theoretic cost is higher than the socially 
optimal cost as expected, and the information theoretic lower 
bounds on the costs and their achievability will be discussed. 


Theorem 4.2: 

1) Let g, and g u represent the informative and the 
non-informative equilibrium game c osts, re spectively. 

US' - «V] 


Then, gi = 3y / AE[m 2 ]E[?u 2 ] + 6 2 
and g u = 2E[m 2 ] + 6 2 . Further, the total cost in the 
game equilibrium is the following 


J*’ 9 = 


[ min{ 5 i, g u } A < E[m 2 ]/E[w 2 ] 
I 9u A > E[m 2 ]/E[w 2 ] 


2) Let ti and t u represent the informative and the 
non-informative team costs, respectively. Then, ti = 
2-v/2AE[m 2 ]E[u; 2 ] + — AE[u> 2 ] and t u = 2E[m 2 ] + 

Y- Further, the socially optimal cost (the total cost in 
the team setup) is the following 

^min{fj, t u } A < 2E[m 2 ]/E[w 2 ] 
t u A > 2E[m 2 ]/E[w 2 ] 


J*’* = 


Proof: 

1) Note from 0 and 0 that we have A = K+ \/ K » 

K = L = - KC and Ab = °( AK - 


1). Also we have ( K 2 + A) 2 E[u> 2 ] = AE[?tt 2 ] which 
implies K 2 = e^I^ and ^ < E[to 2 ]/E[u; 2 ] for 
nonzero A. Recall that if A > E[m 2 ]/E[u> 2 ], then A = 
C = K = L = 0, which implies the non-existence of 
the informative linear (also affine) equilibrium. Thus, 
for A < 

l 

K+X/K 


E[m 2 l , . jv-2 / AEfm 2 ] \ 

ipf by usin S K = V^KT - A ’ A = 

, C = and L + b = —§ in 0, we have 


J*' e = 2 v / AE[m 2 ]E[w 2 ] + 6 2 


I E[m 2 
AE[w 2 


- AE [w 2 ]. 


Now recall that the optimal decoder policy is u* = 
E[m\(y = Am+C+w )] = ^ 2 E ff (y-C), and 

we have o 2 = <r 2 -^ where e = x — E[a;|j/]. In this 

case, x —> m, y —>• y, a 2 —>• E[m 2 ], a xy —> AE[m 2 ] 
and a 2 —> A 2 E[m 2 ] + E[ru 2 ]. Thus, we have 


J*’ d = min E[(m — u) 2 ] = E[(m — E[m|y]) 2 ] 


«=t d (v) 


- ® m 


7 2 

my 


_ r „ A 2 (E[m 2 ]) 2 


= E[m 2 ] - 


A 2 E[m 2 ] + E[w 2 


= xj AE[m 2 ]E[w 2 ]. 


As a result, the game theoretic cost at the equilibrium 
is found as 

J*’S = 3 V / aemekI + y - AE[w 2 ] . 

(9) 

Recall that, if A > E[m 2 ]/E[w 2 ], then J*’ e = E[m 2 ] + 
6 2 and J*’ d = E[m 2 ]; hence, J*’ 9 = 2E[m 2 ] + 6 2 . If 
there were no cost of communication (consider the 
cheap talk; i.e., remove Aa: 2 from the encoder cost 
function), then one could say that the informative 
equilibria would always be beneficial to both the 
encoder and the decoder; however, due to the cost 
of communication, an informative equilibrium is not 
always beneficial to the encoder when compared with 
the non-informative one (i.e., for A < E[m 2 ]/E[u; 2 ], 
it does not always hold that 2 yj AE[m 2 ]E[w 2 ] + 

6 2 \j ae|iP] ' — AE[w; 2 ] < E[m 2 ] + 6 2 ). For the re¬ 
ceiver, however, information never hurts the perfor¬ 
mance and the informative equilibria are more de¬ 
sirable (i.e., for A < E[m 2 ]/E[w 2 ], the inequality 
y / AE[m 2 ]E[w 2 ] < E[t7i 2 ] always holds). As a result, 
one can expect a non-informative equilibrium even if 
A < E[m 2 ]/E[iu 2 ]. 

2) The part below aims to construct the socially optimal 
affine setup. In this part, J e,t represents the team 
cost minimized over the encoder policies for a given 
decoder policy, J d t represents the team cost mini¬ 
mized over the decoder policies for a given encoder 
policy, and J* d represents the optimum team cost; i.e., 
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minimization over all affine encoding and decoding 
policies as follows: 

J *’ 4 = min E[(m— u—b) 2 +Xx 2 +(m—u) 2 ]. 

x=7 e (m) ,u=~/ d (y) 

Similar to the game theoretic analysis above, with the 
given affine encoding policy x = 7 e (m) = Am + C 
(then y = x + w = Am + C + w ), the optimal decoder 
policy can be found as follows (by completing the 
square): 

J d,t = min E[(m — u — b) 2 + Ax 2 + (to — u) 2 } 
«=t d ( v ) 

r h h 2 r 2 1 

= min 2E (to — u -) 2 H- 1 - A— . 

L v 2 4 2 J 

Hence the optimal decoder policy can be chosen as 
7 d,t (y) = E[m — 11 y\. Due to the joint Gaussanity of 
to and y, the minimizer decoder policy is affine: 


Here, either AK = 1 or C = 0. If AK = 1, 
then E[u> 2 ] = 0 which contradicts with the noise 
assumption. Then C = 0 and L = —6/2. By using 
the equalities for A and K above, one can obtain 
2 (K 2 + A/2) 2 E[id 2 ] = AE[m 2 ] by assuming 4 / 0; 
which implies K 2 = yj ^(7^ — Since K 2 is 

positive, A cannot be greater than otherwise, 

because of our assumption, A must be equal to 0 
which implies that K = 0, and there does not 
exist an informative affine team setup. Then K 2 = 

\Z a e[™°| ~ f anc * ^ < 2E[m 2 ]/E[w 2 ] for nonzero A. 


Thus, for A < by using K 2 = yj - 

H = 9/ /|Y x . (7 = 0 and L = — | in (fl2l) . we have 


A 
2 ’ 


J*’* = 2 v / 2AE[m 2 ]E[w 2 ] + — 


AE[w 2 ]m. (13) 


7 d,t (y) = Ky + L 


AE[m 2 ] 

A 2 E[m 2 ] + E[w 2 ] 


(y-c) 


b 

~ 2 ' 

( 10 ) 


Similar to the game theoretic analysis above, for 
any affine decoder policy 7 d (y) = Ky + L with 
y = 7 e (m) + ui, the optimal encoder policy for the 
team setup can be obtained as follows (by completing 
the square): 


J e,t = min E[(m — u — b) 2 + Ax 2 + (m — u ) 2 ] 

x=7 e (m) 

( 2 m — 2 L — b)I\\ 2 


in (2K 2 + A)E [ [^(m) 
m) L V 


2K 2 + A 


■ 2K 2 E[w 2 


= mm 

7 e (m) 

6 2 iT 2 + A (2E[m 2 ] + (i + 6) 2 + L 2 ) 

+ 2 K 2 + A 

Hence, the optimal encoder 7 e (m) is 

7 e ’*(m) = im + C = ( 2m ~ 2L ~ ^ , (11) 

' y ’ 2K + A/K ’ v ; 

and the minimum team cost is obtained as 

b 2 K 2 + A (2E[to 2 ] + (L + b) 2 + L 2 ) 


J *’ 4 = 


2A ' 2 + A 


■ 2K 2 E\v 


( 12 ) 


This implies that, in the team setup, an optimal en¬ 
coder policy for a Gaussian source over a Gaussian 
channel is a affine policy if the decoder policy is 
chosen as affine. 

In order to achieve the socially optimal cost J* d , 
the optimal encoder policy 7 e , 4 (m) and the optimal 
decoder policy 7 d ^(y) must satify the following 
equalities by (flOt and ( 1 1 I [ 1 : 

a 2 HE [to 2 ] 

2K + A/K ’ H 2 E[m 2 ] + E[w 2 ] ’ 

A Ah h 

C = —{—2L — b) = —AL — , L = —KC — — 

*C=-A(-KC-ty-%=AKC. 


Recall that, if A > 2 E[to 2 ]/E[w 2 ], then J* d = 
2E[m 2 ] + Similar to the game theoretic setup, due 
to the cost of the communication, the encoder and the 
decoder may prefer the non-informative equilibrium 
over the informative one (if 2 y / 2AE[to 2 ]E[w; 2 ] + — 
AE[u> 2 ] > 2 E[to 2 ] + ^-). 


Theorem 4.3: The price of anarchy is always larger than 
1, i.e., the sum of the costs under any Nash equilibria is 
always larger than the socially optimal cost. 

Proof: By Theorem 14.21 we have the following 


I min {gi, g u } A < E[to 2 ]/E[w 2 ] 
[g u A>E[m 2 ]/E[w 2 } 

min{fj, t u } A < 2E[to 2 ]/E[w 2 ] 
t u A > 2E[to 2 ]/E[w 2 ] 


Notice that we have f < gi for A < E[to 2 ]/E[w 2 ] and 
t u < gu always. Consider the following cases: 

1) 0 < A < E[m 2 ]/E[u! 2 ] : There are four cases to be 
considered: 

a) minjffi, g u ) = and minjfj, t u } = f: Since 
ti < gi, J* d < J*’ 9 is satisfied. 

b) minjgi, g u } = gt, and min{fj, t u } = t u : Since 
t u < U < gi, J* d < J*’ 9 is satisfied. 

c) minjgi, g u } = g u andminj^, t u } = U: Since 
t% < t u < g u < gu •/*'* < J*' 9 is satisfied. 

d) minjga, g u } = g u and minjfj, t u } = t u : Since 
t u < gu, J*'* < J*’ 9 is satisfied. 

2) E[to 2 ]/E[w 2 ] < A < 2E[to 2 ]/E[u! 2 ] : There are two 
cases to be considered: 

a) mm{tj, t u } = L: Since U < t u < g u , J*’ f < 
J*’ 9 is satisfied. 

b) min{fj, t u ) = t u : Since t u < g u , J* d < J*' 9 is 
satisfied. 

3) A > 2E[to 2 ]/E[w 2 ] : Since t u < g u , J* d < J*’ 9 is 
satisfied. 
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Hence, one can observe that J*’ 9 > J*’ 4 always holds, 
which shows that the price of anarchy is greater than 1, i.e., 
the game theoretic cost is always larger than the socially 
optimal cost. ■ 

In the following, we discuss information theoretic lower 
bounds on the performance of equilibria and socially optimal 
strategies. 

Theorem 4.4: 

1) For the game setup, if A > (he., non-informative 

equilibria), the information theoretic lower bounds on 
the costs are achievable. 

2) For the game setup, if A < and 6 = 0, then the 
information theoretic lower bounds on the costs are 
achievable by linear policies. 

3) For the game setup, if A < and 6/0, the 

information theoretic lower bounds on the costs are 
not achievable by affine policies. 

4) For the team setup, the information theoretic lower 
bounds on the costs are always (both in the informative 
and non-informative equilibria) achievable by affine 
policies. 


Proof: 

1) Recall that the encoder cost is J e = E[(m — u — 
6) 2 + Aa; 2 ] and we know that this reduces to J e = 
E[(m — u) 2 ] + AE[x 2 ] + 6 2 since the decoder always 
chooses u = E[m|y]. From ®, we have a bound 


E [m* 


where 


on the encoder cost J e > b 2 + A P + 1+ p/ E [ w 2 ] 

P = E[x 2 ] represents the power. This bound is 
tight when the encoder and the decoder use linear 
policies leading to jointly Gaussian random variables. 
For A < E[to 2 ]/E[u< 2 ], a minimizer of this cost is 

P* = ,y/ E [ M ini— E[w 2 ]. If we insert this value into 
®, we have J e > 2^/ XE[m 2 ]E\w 2 ] + 6 2 — AE[w 2 ]. 
By the same reasoning above, we also have J d = 

E[(m — it) 2 ] > 


E[n 


1 +- 


> \J AE[m 2 ]E[u; 2 ]. Hence, 

the information theoretic lower bound on the game 
cost J 9 = J e + J d is found as 


J 9 > 3a/AE[to 2 ]E|w 2 ] + b 2 - AE[w 2 ]. (14) 

Through an analysis similar to the one in ®, one can 
see that when A > E[to 2 ]/E[u> 2 ], © is minimized at 
P = 0 (the encoder does not signal any output); thus 
we obtain a non-informative equilibrium: The encoder 
and the decoder do not engage in communications; i.e., 
A = 0 and K = 0 is an equilibrium. In this case the 
encoder may be considered to be linear, but this is a 
degenerate coding policy. This implies J 9 > 2E[m 2 ] + 
6 2 , and remember that J*’ 9 = 2E[to 2 ] + b 2 when A > 
E[m 2 ]/E[u> 2 ], hence the information theoretic lower 
bound is achievable in the non-informative equilibria. 

2) From ® and (ITdl) . it can be deduced that when 6 = 0, 
the lower bound of the encoder cost is achievable by 
linear policies; i.e., C = 0 and L = 0. When 6 = 0, 
the problem corresponds to what is known as a soft- 
constrained version of the quadratic signaling problem 


where we append the constraint to the cost functional 
(see page 96 of ID). 

3) If 6 / 0, then, from © and (IT4l) . one can observe 
that the lower bound becomes unachievable by affine 
policies since the power constraint related part of 
the cost function, \x 2 , contains 6 2 related parameters 
(recall C = ak-i )• this case, by modifying the 
power from P to P — C 2 (which must be posi¬ 
tive) in the information theoretic inequalities; i.e., 
J e > b 2 + XP + 1+(P E ^y E r w a] . then the minimum 
game cost is obtained as J 9 > 3a/AE[ m 2 ]E[w 2 ] + 
b 2 \J ~ AE[ic 2 ] which is the same cost that is 
achieved by affine policies. 

4) By following a similar approach to © for finding 
the lower bound on the socially optimal cost, we can 
obtain: 


J 4 = E[(m — u — b) 2 + Xx 2 + (to — u) 2 ] 


_ b 2 
~ 2 

(“) b 2 

> — 

~ 2 


AE[a; 2 


AP 


-2E 


TO — U — 


2E [to 2 


1 + P/E[w 2 } 


Here (a) holds since the decoder chooses u = E[m — 
-1 y] and shifting does not affect the differential en¬ 
tropy. Similar to the previ ous analysis , a minimizer 

of this cost is P* = _ E[ w 2 ] for 

A < 2E[to 2 ]/E[w 2 ]. If we insert this value into the 
total cost, we have 


__ h 2 

J 4 > 2v / 2AE[m 2 ]E[w 2 ] + — - XE[w 2 ]. (15) 

Recall that, if A > 2 E[to 2 ]/E[w 2 ], then P = 0 
becomes the minimizer, hence J 4 > 2E[m 2 ] + 
in the non-informative equilibrium. Remember that 
J*’ 4 = 2E[m 2 ] + y in this case, thus the informa¬ 
tion theoretic lower bound is achievable in the non- 
informative equilibria. In addition, from (fT3l> and ( 115k 
for A < 2 E[to 2 ]/E[u> 2 ] (which implies the informative 
equilibria), it can easily be seen that the information 
theoretic lower bound is achievable by affine policies 
(actually the encoder policy is linear and the decoder 
policy is affine). ■ 

We state the following summary. 

1) If A < E[m 2 ]/E[w; 2 ] and 6 = 0, then the information 
theoretic lower bound on the game cost is achievable 
by the linear policies. 

2) If A < E[m 2 ]/E[u; 2 ] and 6/0, then the information 
theoretic lower bounds on the game cost are not 
achievable by the affine policies; but they become 
achievable after slight modification on the power pa¬ 
rameter in the information theoretic inequality. 

3) The team cost J*’ 4 in the affine equilibrium is always 
equal to the information theoretic lower bound on the 
team cost. 
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4) The price of anarchy is always greater than 1: The 
socially optimal cost is always lower than the cost in 
any equilibrium. 

5) In the game setup, the non-informative equilibrium 
may be preferred over the informative equilibrium by 
the encoder due to the cost of the signal Ax 2 . 


the n-dimensional zero-mean Gaussian noise with the co- 
variance matrix E ( j, is added to X and the decoder (DM 
2) receives Y = X + W. The policy space of DM 1, T e , 
and the policy space of DM 2, T d , are the set of stochastic 
kernels from R” to R”. The cost functions of the encoder 
and the decoder are as follows: DM 1 aims to minimize 


D. Stackelberg Setup 

If we consider the Stackelberg setup of the signaling game 
problem studied in this section; i.e. the encoder knows the 
policy of the decoder, then it can be shown that the only 
equilibrium is the linear equilibrium. 

Theorem 4.5: The only equilibrium in the Stackelberg 
setup of the signaling game is the linear equilibrium. 

Proof: In the proof, first we assume the linear encoding 
policy and show that the information theoretic lower bound 
is achieved, then we conclude that the encoder policy must 
be linear. Let the encoder policy be x = 7 e (m) = Am + C. 
Due to the Stackelberg assumption, the encoder knows 
that the decoder will use 7 d {y) = u = E[m|y] as an 
optimal decoder policy to minimize the decoder cost, thus 

u = ^ d (y) = ( y ~ c ) where v = Am+c+w. 

Then the goal of the encoder is to minimize the following: 


J*’ e = 


= min E 

A, C 


min E[(to — u — b) 2 + Ax 2 ] 
x=7 e ( m)=Am+C 

mE[w 2 ] — AE[m 2 ]w 


2 ] + E[w 2 

2'mG’L,,2l'|2 , A 2/itfr vv ,2i\ 2 


A 2 E[m^ 


= mm 

A, C 


-bj + X{Am + C) 2 

E[m 2 ](E[w 2 ]) 2 + j4 2 (E[to 2 ]) 2 E[u! 2 ] 2 

(A 2 E[m 2 ] + E[w 2 ]) 2 + 

+ XA 2 E[m 2 } + AC 2 


E[m 2 ]E[u> 2 ] 2 2 2 2 

— mm „ — ^ — 7T7 + b + A A E[m ] + A C“ . 
A, c A 2 E[m 2 ] + E[w 2 ] J 


(16) 


The optimal encoder cost in (HI is achieved for C* = 
0 and A* = 0 for A > E[m 2 ]/E[w 2 ] and A* = 

\j\]w §+-33 for A < K[rn 2 ]/E[w 2 }. Then the op¬ 
timal encoder cost is obtained as J* ,e = E[m 2 ] +b 2 for A > 
E[to 2 ]/E[w 2 ] and J*’ e = 2^/XE[m 2 ]E[w 2 } + b 2 - AE[w 2 ] 
for A < E[m 2 ]/E[ui 2 ]. Note that these are the information 
theoretic lower bounds in the proof of the first part of 
Theorem 14.41 and these lower bounds are achieved when the 
encoder and the decoder use linear policies jointly, which is 
valid for the current case. ■ 


V. Quadratic Signaling Game: 

Multi-Dimensional Gaussian Noisy Case 

The scalar setup considered in Section[IV]can be extended 
to the multi-dimensional Gaussian noisy signaling game 
problem setup as follows. The encoder (DM 1) encodes 
an n-dimensional zero-mean Gaussian random variable M 
with the covariance matrix E ^ and sends the real-valued 
n-dimensional random variable X. During the transmission, 


J e { 7 e ,7 d ) 

= J c e (fh,x,u)'y e (dx\rh)'y d (du\y)P(dy\x)P(dm), 

whereas DM 2 aims to minimize 

J d { 7 e ,7 d ) 

= J c d (m,u)^ e (dx\m)"f d (du\y)P(dy\x)P(drh), 

where P(dy\x) = P{W G dy — x) with W ~ JV(0, E^). 
The cost functions are c e (m, x, u) = \\fh — u — 6|| 2 + A||x || 2 
and c d (m, u) = ||m — n || 2 where the lengths of the vectors 
are defined in L 2 norm and b is the bias vector.. Note that 
we have appended a power constraint and an associated 
multiplier. If A = 0, this corresponds to the setup with no 
power constraint at the encoder. 


A. Affine Equilibria 

Theorem 5.1: 

1) If the encoder is linear (affine), the decoder, as an 
MMSE decoder for a Gaussian source over a Gaussian 
channel, is linear (affine). 

2) If the decoder is linear (affine), then an optimal en¬ 
coder policy for a multi-dimensional Gaussian source 
over a multi-dimensional Gaussian channel is an affine 
policy. 

3) An equilibrium encoder policy 7 e (m) = 

Am, + C satisfies the equation A = T(A) 
where T{A) = ( FF T + A/) 1 F and 

F = (AEjgA T + E^,) AEjtf. 

4) There exists at least one equilibrium. 

Proof: 

1) Let the affine encoding policy be x = 7 e (m) = Am + 

C where A is an n x n matrix and C is an n x 1 vector. 
Then y = x+w = Am+C+w. The optimal cost of the 
decoder, by the law of the iterated expectations, can 
be expressed as J*’ d = min^^d^) E [j|m — 'u|| 2 |y]. 
Hence, a minimizer policy of the decoder is u = 
7 *’ d {ff) = E Since both m and y are Gaussian, 

then the optimal decoder is 

E [m\y\ = E [m\ + E^-.E A\(y - E[$) 

= E a A T (AE a A T + E^y\y^C). ( 17 ) 

2) Let the affine decoding policy be u = 7 d (y) = Ky+L 
where K is an n x n matrix and L is an n x 1 vector. 
Then u = Ky+L = K{x+w)+L = Kj e (m)+K'w+ 
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L. By using the completion of the squares method, the 
optimal cost is 


J*’ e = ^ min_ E 

x=f e (m) 

= E 


= E 


min E 

x='y e (m) 


min E 

7 e (m) 


ra — u — b\\ 2 + A||tc || 2 
m — u — b\\ 2 + A||cc || 2 

({K t K + A/) 7 e (m) - K 


m 


x (to — L — 6 )^ ^ K t K + A I^j 
x (( K t K + A/) 7 e (m) - K T {m - L-b )) 


+ (to - L - bj T (i - K(K t K + XI)- 1 1 < r ) 


(ih-t-b) 

fh 

+ E 

iu t K t Kw 

V / 



L J 


Hence, the optimal 7 e (m) can be chosen as follows: 

7* ,e (m) =Am + C= (/T T iT + A/) 1 K T 

x (m-L-by (18) 

3) We have K = ^^A T (AT,^A t + E ^) _1 and A = 
(^K t K + Xlj K t from (fTTI i and (flSl i. By combin¬ 
ing these, A = T(A ) = (FF t + A/) 1 F can be 
obtained. 

4) Since FF T is a real and symmetric matrix, it is 
diagonalizable and can be written as FF T = QTQ~ X 
for a diagonal T. Now consider ||T(A)||^ where || • ||f 
denotes the Frobenius norm: 


\\T(A)\\ f =tr(^(FF T + \l) V) 

x ((FF T + A/) _ 1 F) T j 


= tr (T + A/) _i T(T + XI) 


= £ 


\-i 


. . [vi + A ) 2 

1=1 


(19) 


where Vi, i = 1 ,,n are the eigenvalues of FF T 
and since FF T is positive semi-definite, all these 
eigenvalues are nonnegative. Since A > 0, we observe 
the following: 


iH <E [0,1] 
Vi G (1, 00 ) 


Vi 


< 


1 


(Vi + A ) 2 A 2 ’ 

Vi < ^ = 1 < 1 , 
('Vi + A ) 2 vf Vi 


Hence, Vi/(vi + X) 2 < max(l,l/A 2 ) always holds. 
Then, by ( fT9l >. we have ||T(A)||p’ < nmax(l,l/A 2 ), 
which implies that T(A) can be viewed as a continu¬ 
ous function mapping the compact convex set 114111^ G 


[0,nmax(l, 1/A 2 )] to itself. Therefore, by Brouwer’s 
fixed point theorem |3_7], there exists A = T(A). ■ 

We note, however, that there always exist a non-informative 
equilibrium (see Proposition 12.11 which also applies to the 
signaling game discussed in this section). However, there 
exist games with informative affine equilibria as we state in 
the following (see Theorem 15.2b . 

Proposition 5.1: If either A or is zero, an informative 
affine equilibrium exists only if A, and b are all zero. 

Proof: Note that, from (ITT] ) and (fl 8 ]> . we have A = 
(l< T K + A/)^V t , K = E v j/l r (/IS v ;.4 r + E h >) -1 , 

L = —KC and Ab = (AK — I)C. From these equalities, 
we can analyze the equilibrium as in the scalar case: 

1) when A = 0 and the noise is zero (E ^ = 0), then 
A = R- 1 and K = A~ x are obtained. Then Ab = 
(AK — I)C = 0 , thus the consistency of the equalities 
can be satisfied if only if b = 0. Hence, if b f 0, there 
cannot exist an informative affine equilibrium. Recall 
that in the multi-dimensional noiseless cheap talk, the 
linearity of the equilibrium is shown for the uniform 
source; here the source is Gaussian. 

2) when A = 0, then A = K~ l and AY,^A T + E^ = 
K~^mA T are obtained. There does not exist a 
solution to 0 unless the noise is zero (E^ = 0 ). 
Even when 0 has a fixed point solution A, 0 and 
0 cannot hold together unless b = 0 . 

3) when the noise is zero (E^ = 0), then K = A~ x and 

K t K + XI = K t A~ 1 are obtained. There does not 
exist a solution to 0 unless A = 0. Even when © 
has a fixed point solution A, 0 and 0 cannot hold 
together unless b = 0 . ■ 

Remark 5.1: In the multi-dimensional case, fixed points 
may not be unique: with A = 1.0311 and 


1.6421 

0.1299 

0.5713 

0.2305 

0.1299 

1.4803 

0.6810 

0.4749 

0.5713 

0.6810 

1.7312 

0.4292 

0.2305 

0.4749 

0.4292 

1.3515 

1.2742 

0.1868 

0.2318 

0.0559 

0.1868 

1.8266 

0.5955 

0.3091 

0.2318 

0.5955 

1.2377 

0.4951 

0.0559 

0.3091 

0.4951 

1.5336 


we can obtain two fixed points with different absolute-valued 
elements as follows (recall that if A is a fixed point, —A is 


also a fixed point): 





—0.1543 

0.1762 

0.0606 

0.1117 

A = 

0.1602 

0.0159 

0.1036 

0.0279 

- 0.2000 

-0.1879 

-0.2700 

-0.1565 


_ 0.0603 

0.1052 

0.1221 

0.0824 


—0.2431 

0.0738 

-0.0752 

0.0285 

A = 

0.0293 

-0.1351 

-0.0966 

-0.0948 

0.1520 

0.2181 

0.2682 

0.1735 


-0.1003 

-0.0801 

-0.1236 

-0.0683 
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Theorem 5.2: Let source M be a zero-mean n- 
dimensional Gaussian random variable with covariance ma¬ 
trix = diag{a ^ ni ,..., } where diag indicates a 

diagonal matrix, and noise W be a zero-mean n-dimensional 
Gaussian random variable with covariance matrix E^ = 

diag{a%, ,, <J 2 . }. Then an informative affine equilibrium 
1 n 0-2 a 2 
exists if A < maxj—.... 

*- <7^^ 7 1 (T£j j 

Proof: Since the source components are indepen¬ 
dent and the noise components are independent, the n- 
dimensional noisy signaling game problem turns into n in¬ 
dependent scalar noisy signaling game problems as follows: 

1) If the decoder uses the channels independently; i.e., 
Ui = 7 f(yi) for i = 1 ,,n, then the optimal cost of 
the encoder will be 

J*’ e = __ min^ E |"||m — u — b\\ 2 + A||cc || 2 

x=f e (rh) L 
n 

= _ min V E[(mj - 7 f{Vi) ~ h) 2 + Ax 2 ] 

x=7 e (fn) z ' 

2—1 

n 

= - mi n E[(mi - 7 f(yi) - b z ) 2 + Ax 2 ] . 

z ' x= 7 e (m) 

2=1 

Since, j/j — Xi +Wi for each i = 1,... ,n, the optimal 
encoder also uses the channels independently; i.e., 
Xi = 7 ® (mj) for i = 1 ,..., n. 

2) Similarly, if the encoder uses the channels indepen¬ 
dently; i.e., Xi = 7 f(nrii) for i = 1 then the 

optimal cost of the decoder will be 

J*’ d = min E[||m —w|| 2 ] 

a=i d {y) 

n 

= min E[(mj — Ui ) 2 ] . 

~f u=i d (y) 

Since, yi = 7 f{mi) 4- Wi for each i = 1 ,n, the 
optimal decoder will also use channels independently; 
i.e., m = ■yfiyi) for i = 1,... ,n. 

Thus, we have, in each dimension i (i = 1,..., n); 

« the source Mj is a zero-mean Gaussian with variance 

« the channel has the Gaussian noise Wi with zero-mean 
and variance cr^., 

« the encoder’s goal is to find the optimal policy which 
minimizes its cost min Xi= 7 e( T7X .) E[(mi — m — hi) 2 + 
Ax 2 ], 

• the decoder’s goal is to find the optimal policy which 
minimizes its cost min„. = 7 <i( yi ) E[(m* — Ui ) 2 ]. 

For each dimension, the informative affine equilibrium exists 
if A < am-f&w-- F° r the multidimensional setup, the 
existence of the informative equilibrium in at least one di¬ 
mension implies the existence of the informative equilibrium 
for the whole sytem. Hence, it is sufficient that the inequality 
A < <Jm / a w i s va hd for at least one dimension. As a 
result, the condition for the existence of the informative 

<7 2 a 2 

affine equilibrium becomes A < max{Mp-,..., ■ 


Note that, from (fT71) and (fl 8 l) . by assuming |A| 7 ^ 0, we 
have XAY, a jA t = K T KT,^ which is equivalent to 

X(A t )~ 1 E k} A t = (. K t K + A I){K t K + AI)E # . (20) 


Remark 5.2: Assuming all channels are informative, i.e., 
|A| 7 ^ 0 , we make the following observations. 

• If the source is i.i.d.; i.e., = cf^I, then d20t 

becomes 

A (A T )- 1 a 2 m IA T = ( K t K + XI){K t K + A/)E # 

=► Aa^(E # )- 1 = ( K t K + X I)(K t K + XI) 

=> A^(E ^)- 1 > A 2 / 

=* A/< o* (E^)" 1 . 

This result implies that A must satisfy the inequality 
XI < cr^ 1 (E ^) _1 for the i.i.d. source; otherwise, there 
must be at least one non-informative channel; i.e., A 
must be 0 . 

• If the channel noise is i.i.d.; i.e., E^ = cr 2 /, (since E^ 
is real-symmetric, it has the eigenvalue decomposition 
as Ejg = QAQ t ), then ( l20t becomes 

A(A t )- 1 E a 1 A t = (K t I< + XI)(K t K + X I)a 2 w I 

=4 \{A T )~ 1 QAQ T A T = (K t K + XI)(K t K + XI) 

=> (A t )- 1 QAQ t A t > Xo 2 w . 


This result implies that for each eigenvalue Xg of E g , 
A must satisfy A < Xjg/at^ for the i.i.d. channel noise; 
otherwise, there must be at least one non-informative 
channel; i.e., |M| must be 0 . 

• For the general case, recall the Minkowski determinant 
theorem, |A + B'fl n > IAI 1 /” + \B\ 1 / n which holds 
for any non-negative n x n Hermitian matrix A and 
B. This implies \A + B\ > |A| + |H|. By using this 
inequality and (ITSi) . 

Ml = 1^1 < l g l 

1 1 \K T K + XI\ ~ \K\ 2 + X n ' 


Assuming |A| 7 ^ 0, recall the equality AAE^M T = 
K T KA.g. Taking the determinant of both sides. 


1*1 2 




I^mI < A" 


1*1 


< A’ 


\Kl 

X 2n 


M I 


A” < 


|AT | 2 + A” 

E, 


I s 


Ml 


J M I 


-‘W I 


The result can be interpreted as follows: If A > 

(jifl ) 1 then = |*| = 0 in the equilibrium; 
i.e., there must be at least one non-informative channel. 


B. An information theoretic lower bound on the encoder cost 
and the existence of informative equilibria 

We end the section with an information theoretic lower 
bound for the encoder cost; this serves us also to obtain 
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condition for the existence of an informative equilibrium. 
Let e = fh — u = rh — E[m|y], then we have E g = E[ee‘ r ] = 
E[(m — E[rh\y\)(fh — E[m|i/|) T ]. From the information 
theoretic inequalities; 

I (rh; y) = h{rh ) — h(fh\y) = h(fh) — h(rh — E[m|y] \y) 

> h(rh) — h{fh — E[m|j/]) 

> him) - i k>g 2 ((27re)"|Eg-|) 

= \ log 2 ((27re) 7l |E*|) - i log 2 ((27re) n |Eg-|) 

= 7^ l°g2(l^m|/|^e|) • 

Also from the rate-distortion theorem, the data processing 
theorem and the channel capacity theorem: 


R(D) < min I(rh;u) < I(rh;u) 

/(n|m):E[||m— u\\ 2 ]<D 

< I{x;y) < max I{x;y) < C(P). 

/(x):E[|H| 2 ]<P 

If we combine these, we obtain the following: 

|£e| > \E^\2~ 2R ^ > |E^|2- 2/ ^ ;S ) 

> |E*|2- 27 ^ > |E*|2- 2C ( p ). (21) 

Now consider the following: 


E[||m - u|| 2 ] = E[||e|| 2 ] = trEg- > 


C b) 

> n 


(|£e|) 


1/n (c) 




i=1 


>-2 C(P) 


1/n 


1/n 


( 22 ) 


Here, (a) follows from the inequality for the arithmetic 
and geometric mean where E f -(z, i) stands for /th diagonal 
element of E g, (b) follows from the Hadamard inequality 
(since Eg- is a positive semi-definite matrix), and (c) follows 
from (|2TT) . Now we will rewrite |39j Eq. (9.166)] which 
presents the capacity of the additive colored Gaussian noise 
channel with typo corrected: 

max(^ — Xi, 0)' 


C(P) = i£!>0fe(i + 


i= 1 
121 


Xi 


where P = E[||x|| 2 ], Ai,A 2 ,...,A n are the eigenvalues of 
E^j and v is chosen so that ^" =1 max(z/ — A*, 0) = nP. 

Then we can obtain the following: 


2 -2C(P) = 2 


-2^ E \ log 2 ( !+ 


max(i/-A^ ,0) 


n 

n( 

i=l 

n 

n( 

i=1 


1 + 


max(^ — Xi, 0) ^ _1 / n 

Ai 


max(j/, Aj) ^ ~ 1 /' n 

~Xi ) 

(nr = iA,) i/n 

(nr=i max (^ Ai)) i/n 
(i^i) 1/n 
" (P + Eti^t) 

1/n 




Here, (a) holds, since our assumption Y^i=i max (v — 
Aj,0) = nP implies YJL\ lnax (' 7 - A,;) = nP + Yh=i At 
and (n”=i max (^ Aj)) 1 /" < J2i=i max (V, h)/n = P + 
1 Aj/n holds by the inequality for the arithmetic and 
geometric mean. If we insert (f23l> to (l22l) . 



= n(|E 7S |) 1 /”(|E !i? |) 1/ ” 2 (p+^) 1/n . (24) 

The encoder costs reduces to J e = E[||m— m|| 2 ]-|-AE[||x|| 2 ]+ 
||6|| 2 since the decoder always chooses u = E[m|y]. Then, 

by (El. 

J e = ||&|| 2 + AE[||f|| 2 ]+E[||m-tI|| 2 ] 

> H&ll 2 + AP + ndE^D^dS^I) 1 /” 2 (p + _1/ 

(25) 

The minimizer of this function can be found by the local 
perturbation condition: 

A-dE^D^dEtfl) 1 /" 2 (P + ”" _1 = 0 

=> A = (lE^D^dE^I) 1 /” 2 (P + ~" _1 

< (l^m|) 1/n (|Eu;|) 1/n2 ((|E^j|) 1/n ) " _1 
= (|S™|) 1/ ”(|E 1 ,|)- 1/ ”. 

Here, (a) follows from the nonnegativeness of P and 
the inequality for the arithmetic and geometric mean and 
the Hadamard inequality, similar to d22l) . Hence, if A < 
(lE^D^dEtuI) -1 /", ^ lower bound is minimized at a 
nonzero P value, but if A > (|E,^|) 1 / n (|E. ! 2 |) _1 /", t ^ le 
minimizer P becomes zero. Finally, if channels and source 
are assumed to be i.i.d.; i.e., E,^ = cr^I and E^; = cr^I 
where I is n x n identity matrix, and the encoder and the 
decoder use linear policies, then (l25l > becomes tight and can 
be interpreted as follows: If A > (|E^,|) 1 / n (|E^|) _1 / n = 
cr 2 ,/cr 2 ,, then (f25l) is minimized at P = 0; that is, the encoder 
does not signal any output. Hence, the encoder engages in an 
non-informative equilibrium and the minimum cost becomes 
E [||m|| 2 ] +1|6|| 2 at this non-informative equilibrium. Recall 
that this is analogous to the analysis in the scalar setup ([Sj. 

VI. Concluding Remarks 

For a strategic information transmission problem under 
quadratic criteria with a non-zero bias term leading to a 
mismatch in the encoder and the decoder objective functions, 
Nash and Stackelberg equilibria have been investigated in a 
number of setups. It has been proven that for any scalar 
source, the quantized nature of Nash equilibrium policies 
hold, whereas all Stackelberg equilibrium policies are fully 
informative. Further, it has been shown that the Nash equi¬ 
librium policies may be non-discrete and even linear for a 
multi-dimensional cheap talk problem, unlike the scalar case. 
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The additive noisy channel setup with Gaussian statistics has 
also been studied, such a case leads to a signaling game due 
to the communication constraints in the transmission. Condi¬ 
tions for the existence of affine Nash equilibrium policies as 
well as general informative Nash equilibria are presented 
for both the scalar and multi-dimensional setups. Lastly, 
we proved that the only equilibrium in the Stackelberg 
noisy setup is the linear equilibrium. Our findings provide 
further conditions on when affine policies may be optimal 
in decentralized multi-criteria control problems and lead to 
conditions for the presence of active information transmis¬ 
sion in strategic environments. Recently, we have extended 
our analysis in this paper to study dynamic signaling games 

Mi- 
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